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Abstract 

This paper ties the line of work on algorithms that find an 0( Vlog n)-approximation to the SPARS- 
EST CUT together with the line of work on algorithms that run in sub-quadratic time by using only 
single-commodity flows. We present an algorithm that simultaneously achieves both goals, flnding an 
0(Ayiog(n)/£)-approximation using 0(n^ log*^'^' n) max-flows. The core of the algorithm is a stronger, 
algorithmic version of Arora et al.'s structure theorem, where we show that matching-chaining argument 
at the heart of their proof can be viewed as an algorithm that finds good augmenting paths in certain 
geometric multicommodity fiow networks. By using that specialized algorithm in place of a black-box 
solver, we are able to solve those instances much more efficiently. 

We also show the cut-matching game framework can not achieve an approximation any better than 
n(log(n)/loglog(n)) without re-routing flow. 



1 Introduction 

We consider the problem of partitioning a graph into relatively independent pieces in the sense that not 
too many edges cross between them. Two concrete optimization problems arising in that context are the 
SPARSEST CUT and BALANCED SEPARATOR problems. We are given an undirected weighted graph G on 
n vertices, where each edge xy has capacity G^y (we identify a graph with its adjacency matrix). The 

edge expansion of a cut {S,S) is h{S) = " ■ The sparsest cut problem is to find a cut {S,S) 

minimizing h{S); we write h{G) to denote the value of such a cut. The balanced separator problem has 
the same objective but the additional constraint that min{|5|, \S\} > il{n). Both problems are NP-hard, so 
we settle for approximation algorithms. 

Most of the original work on graph partitioning focused on achieving the best approximation factor and 
falls into one of two themes. The first is based on multicommodity flow, using the fact that if a graph H of 
known expansion can be routed in G via a feasible flow, then h(H) < h{G). If H is some flxed graph, flnding 
the best possible lower bound is equivalent to solving the maximum concurrent flow problem; i.e., maximizing 
a such that F < G and D > aH where D^y = J2p-x^y fp demand graph and F^y — J2p3xy fp 

flow graph of the underlying flow. By taking H to be the complete graph, Leighton and Rao showed an 
upper bound of h{G) < 0{logn)a* h{H) for the optimal a*, yielding an O(logn) approximation. The other 
theme is the discrete Cheeger's inequality of Alon and Milman,2 characterizing the relationship between 
cuts and the spectrum of a graph's Laplacian matrix. In particular, if G has maximum degree d, then 
A2(£g)/2 < h{G) < y^2d\2{CG), where A2(£g) is second smallest eigenvalue of G's Laplacian. The two 
themes are incomparable, as the latter is a better approximation when G is an expander (i.e., h{G)/d is 
large) while the former is better when G has sparse cuts. 

Arora, Rao, and Vazirani naturally combined the two themes. Rather than embedding a flxed graph 
H of known expansion, they embed an arbitrary H and then certify iJ's expansion via A2(£h)[7^. Since 
M{C,h) > a is equivalent to Ch h -^k, where K is the complete graph, the problem of flnding the best 
such lower-bound can be cast as a semideflnite program: 

d 

maxa s.t. —Cfcdi^D, F<G (1) 
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They showed that for the optimal a*, one has an upper bound of h(G) < 0{\/log n)a* , yielding the cur- 
rently best known approximation factor. Shortly thereafter, Arora, Hazan, and Kale designed a primal-dual 
algorithm to approximately solve ([ij in 0{n^) time using multicommodity flows [3 . 

More recently, researchers have focused on designing efficient algorithms for graph partitioning that beat 
the quadratic multicommodity flow barrier. Khandekar, Rao, and Vazirani designed a simple primal-dual 
framework for constructing such algorithms based on the cut-matching game and showed one could achieve 
an O(log^ri) approximation in that framework using polylog max-flows[Tn]. Arora and Kale designed a 
very general primal-dual framework for approximately solving SDPsfS^. They showed efficient algorithms for 
several problems could be designed in their framework, including an 0(log ri)-approximation to sparsest 
CUT using polylog max-flows. They also showed one could achieve an 0(-y/Iogri)-approximation in their 
framework using multicommodity flows, simplifying the previous algorithm of Orecchia et al. extended 
the cut- matching game framework of [15 to achieve an 0(log n) approximation [20 . They present two slightly 
different algorithms, and remarkably, their second algorithm is the same as Arora and Kale's, even though 
they never explicitly mention any SDP. They also showed a lower bound of ^l{\/\og n) on the approximation 
factor achievable in the cut-matching framework, suggesting the framework might precisely capture the 
limits of current approximation algorithms and posed the question of whether 0(\/logn) could be efficiently 
achieved in that framework. 



1.1 This Paper. 



We tie those two lines of work together by simultaneously achieving the O(VIogn) approximation factors of 
the former with the nearly max-flow running time of the latter. 

Theorem 1.1. For any e E [0(1/ log(n)), there is an algorithm to approximate </ie SPARSEST CUT and 

BALANCED SEPARATOR problems to within a factor of 0{y^log{n) / s) using only O log*^'-"'^^(n)^ max-flows. 

Theorem |1.1| effectively subsumes the results of [21 HB [SI HD], as taking e = 6(l/log(rt)) yields an 
0(log(n)) approximation using polylog max-flows, while any constant e < 1/2 achieves an 0{yJ\og{n)) 
approximation in sub-quadratic 0(m -I- time using the max-flow algorithm of Goldberg and Rao[14;. 
We also show the cut-matching game framework of [15' can not achieve an approximation better than 
ri(log(n)/loglog(n)) without re-routing flow. 

We build heavily on Arora and Kale's work, achieving our improvement by replacing their use of a black- 
box multicommodity flow solver with a specialized one that makes use of the additional structure present 
in the flow instances that arise. We begin in section |2] by reviewing the nature of those flow problems, as 
well as the main ideas behind the algorithms of [31 |5J [^U]. Having clarifled the connection to partitioning, 
we also state our main technical result, theorem |2.3[ In section [3] we describe the details of our algorithm. 



the correctness of which follows immediately from theorem |2.3| The proof of theorem 2_^ appears in section 
|4] Our lower-bound for the cut-matching game is then discussed in |5] and we finish with some concluding 
remarks in section |6l 



2 Expander Flows 

Expander-flow based algorithms all work by approximately solving ([T]), either explicitly as in [3 |S], or 
implicitly as in |15} 120] . by iteratively simulating play of its corresponding two-player zero-sum game. The 
game has two players: the embedding player and the flow player. The embedding player chooses a non-trivial 
embedding V = (vi, . . . , v„) e (M'')" of the vertices of G. The flow player chooses a feasible flow F < G 
supporting demands D with the goal of routing flow between points that are far away in the embedding. 
More precisely, the payoff to the ffow player is: 



HV,D) 



For given demands D, the best response for the embedding player is the one-dimensional embedding given 
by an eigenvector of Cd of eigenvalue X2{Cd), yielding a value of A2(>C_d). On the other hand, for a given 
embedding, the best response for the flow player is a solution to the weighted maximum multicommodity 
flow problem given by 

max^D^yllv^ - Vyip s.t. F<G (2) 

x<y 
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The frameworks of [TSJISIEQ] start with an initial embedding V^, such as all points roughly equidistant. 
On a given iteration t, the algorithm presents V* to the flow player, who must either respond with demands 
Z?* of value <i>(y*,D*) > 1, or a cut C* of expansion at most k, where k is the desired approximation 
factor. In the latter case the algorithm terminates; in the former case, the demands are used to update the 
embedding for the next iteration. The precise update differs among each algorithm, but essentially vertices 
X, y with large 13* y will be squeezed together in the embedding. The analysis of [HI [3 [20] show that after T 
iterations, for sufficiently large T, their adaptive strategies actually played nearly as well as they could have 
in hindsight, in that 

Since averaging T feasible flows yields a feasible flow, after T iterations the graph D = {D^ + • • • + D'^)/T 
with X2{Cd) > has been routed in G. Thus, for a given graph, the algorithm either routes an 51(1)- 
expander-flow in G or else flnds a cut of expansion n. Using a binary search and scaling the edge capacities 
appropriately yields an 0{k) approximation algorithm. 

The embedding can be updated in nearly linear time, and T = 0(log'^'^''(rt)), so the running time of 
such algorithms is dominated by the running time of the flow player. By sparsifying G (using e.g. [Ej), we 
can and shall assume it has m — 0{nlogn) edges. Using Fleischer's multicommodity flow algorithm [T^ as 
a black box, a nearly optimal pair of primal/dual solutions to (|2| can be computed in 0{n^) time. Note 
that ([2]) has demand weights for every pair of vertices, so D,{n^) space is required to even explicitly write it 
down. On the other hand, each v^; e ^^(logri)^ ^^le weights ||vj; — Vyip are all implicitly stored in only 
0(n log n) space. Therefore, making use of the additional geometric structure of these instances is crucial to 
achieving sub-quadratic time. Implicit in all of [r5[[51l2nj is a specialized algorithm to approximately solve 
([2]). The actual algorithm used is the same in all three, and those algorithms differ only in their strategy for 
the embedding player. 

In the next two subsections, we briefly sketch the single-commodity and multicommodity flow based 
algorithms of [S], and then describe how we tie the two together. In particular, our algorithm is essentially 
an "algorithmetization" of the multicommodity flow algorithm's analysis. For the rest of the section, suppose 
we have an embedding V with ^^^y \\^x — ^yW^ — n-^i and let us further assume that the points are unique 
and ||v3;|| < 1 for all x; i.e., the diameter is not much more than the average distance. 

2.1 Using Single-Commodity Flows 

Consider flrst the absolute simplest case, where d = 1 and the points are simply numbers in [—1,1]. It is 
easy to see that since the points are in [—1, 1], unique, and have average squared-distance f2(l), there must 
be some interval [a, b], where b — a — 51(1) =: a and the set of points to the left of a, yl = {x : v^, < a} and 
to the right oi b, B = {y : Vy > b} have \A\ — \B\ — ^l{n) —: 2cn. A natural way to try to push flow far 
along this line would be to shrink A and B down to single vertices and then compute a max-flow from A to 
B. 

FlowAndCut(K, c,Wi, . . . , Wn G M): 

• Sort {wx}, let A be the 2cn nodes x with least Wx and B be those with greatest Wy. 

• Add two vertices s,t. Connect s to each x E A and t to each y E B with edges of capacity k. 

• Output the max-flow/min-cut for s ~ t. 



Consider invoking FlowAndCut(K, c, Vi, . . . , v„) with k = c^^a^^. If the max-flow is at least Kcn, then 
since all flow must cross the gap [a,b], we have pushed Kcn units of flow across a squared-distance of ct^, 
achieving a solution D with $(y, D) > {Kcna^^jn — 1. Otherwise, if the min-cut is at most nan, then at 
most cn of the added K-capacity edges are cut, so at least cn vertices must remain on each side and the cut 
has expansion at most k. That is, for dimension one a k = 0(1) approximation is obtained. 

The approach of [5j [20] is to reduce the general case to the one-dimensional case by picking a random 
standard normal vector u and projecting each Vj; along u, yielding the 1-dimensional embedding Wx = v^; -u. 
The fact that the points are in the unit ball and have average distance 51(1) implies that with probability 
51(1), there is a gap [a, &] with b—a = ^{l) = a as before. Applying the previous analysis, we either find a cut 
of expansion 0(1) or a flow with X]a;<i/ ^xyiwx ~ Wy)'^ > n. Then, the Gaussian tail ensures that distances 
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could not have been stretched too much along u: with high probability {wx — WyY < 0(logn)||vj; — Vj,|p 
for every pair x,y. Thus, ^(V,D) > ri(l/log(n)), yielding an O(logn) approximation. 

2.2 Using Multi-Commodity Flows 

Arora, Rao, and Vazirani showed that, if a best response D* to V has ^{V,D*) < 1, one can find a cut of 
expansion 0(\/log n). Supposing the optimal solution to Q has value at most n, there must be a solution to 
the dual problem of value at most n. The dual assigns lengths {we} to the edges of G, aiming to minimize 
J2e^eWe subjcct to the Constraints that the shortest-path distances between each x,y under {we} are at 
least ||vj. — Vj^lp. Arora and Kale show the existence of such a dual solution implies that projecting the 
points along a random u and running FlowAndCutwith k G ( \/log n) must yield a cut of capacity at most 
Kcn with probability ri(l). 

If not, then a flow of value at least Kcn is returned for 0(1) of the directions u along which A and B 
are cr-separated. For simplicity, assume that the flows actually correspond to a matching between A and B. 
That is, each x has either zero flow leaving, or else has exactly k flow going to a unique y along a single path. 
On the one hand, that matching is routed in G along cn flowpaths, each carrying flow n. On the other hand, 
the total volume of G is only GgWe — n, so il,{n) of those flowpaths must have length at most 0(1/k) 
under {we}- 

For each u, let M(u) be the matching consisting of those demand pairs routed along such short paths. 
Then, according to the following definition, M is an (ri(l), ri(l))-matching-cover. 

Definition 2.1. A (a, 5) -matching- cover for an embedding {^x} is a collection {Af(u)}ug]gd of directed 
matchings satisfying the following conditions. 

• Stretch: (vj, — v^.) • u > cr for all {x,y) G AI (u) 

• Skew- symmetry: {x,y) G M(u) ifj(y,x) G M(— u) 

• Largeness: Eu [|Af(u)|] > 6n 

For a list of vectors Ui, . . . , Ur, let Af (ui, . . . , Ufj) denote the graph that contains edge (x, y) iff there exist 
Xq, . . . , xn with xq = X, Xfi = y and (x^-i, Xr) € M(uj,) for all r < R. For the empty list, let M() denote 
the graph where each vertex has a directed self-loop. Note that A/(ui, . . . ,Ui^) is not a matching, but rather 
a graph with maximum in-degree and out-degree one. 

Furthermore, M has the property that for each edge {x,y) G M{u), the distance between x and y under 
{we} is at most 0{1/k). The following theorem holds for M. 

Theorem 2.2 f [16j. refining j7]). Let M be a -matching- cover for {'Vx}- Then, there are vertices 

x, y and Ui, . . . ^ur where R < 0{^/log n) such that (x, y) G M (ui, . . . , vlr) and ||v^ — Vy|p > L. 

In other words, there are vertices x,y with ||vj; — Vj,|p > L that are only R matching hops away in Af. 



Applying theorem 2.2 there are vertices x, y with Hv^, — Vj,|p > L but of distance only RO{1/k) under {we}. 
Choosing k = 0{R/L) = 0{^log n) yields a contradiction to the assumption that {wg} is dual feasible. 

2.3 Results. 

Our improvement comes from being able to achieve an 0(^log n) gap between cut and flow solutions, as 
in the latter case, while still only using single-commodity flows, as in the former case. Recall the case of 
d > 1 was reduced to the d = 1 case by projecting along a random vector and bounding the squared-stretch 
by O(logn). Indeed, the stretch could be nearly that much, so simply pushing flow along a single direction 
will not allow us to achieve anything better; in fact, that is the main idea behind our lower bound for the 
cut-matching game. 

To do better, we need to do something more sophisticated than simply push flow along a single direction. 
A natural idea is to try to pick several directions Ui, . . . ,u/j, push flow along each of them, and then try 
to glue the flows together to actually push flow far away globally. One motivation for such an approach is 
that it seems to be the next simplest thing to do, following that of using only a single direction. The second 
and most crucial motivation is to observe that such an approach is strongly suggested by the analysis for 
the multicommodity flow algorithm just sketched. To see that, suppose the typical flowpath along a random 



u routes between points of squared-distance A. Theorem 2.2 says we can always augment R = 0{^/log n) 
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such flowpaths to route demand between points of squared-distance L — at the cost of possibly raising 

congestion by a factor of R. Thus, either A > L/R = y/logn), or else augmenting together R typical 
flowpaths and scaling down by R maintains feasibility and increases the objective of ^ . 

Unfortunately, theorem |2.2| doesn't say anything at all about finding such directions u, or whether the 
same Ui,...,Ufl will simultaneously work for many vertices. To analyze such an algorithm, we need a 
stronger, algorithmic version of theorem |2.2[ Our main technical contribution is such a theorem. 



Theorem 2.3. For any I < R< e{y/log{n)), there is L > Q{R^/\og{n)) and an (efficiently sample-able) 
distribution T) over {W^)~^ with the following property. 

If M is an (fl{l),fl{l))-matching-cover for {v^;}, then the expected number of edges {x,y) with {x,y) £ 
M{T>) and ||v^ - v^p > L is at least e'^'^^^'^n. 



Using theorem 2.3 and choosing R = Q{^J~F\og{n))^ L = 6(e), we simply sample Ui, . . . ,Ufl from V, 
and let our final flowpaths be the concatenation of those along Ui, . . . , u/j. On average, we get n^~^ such 
paths, and thus have simultaneously routed n}^^ paths between points of squared distance L using only 
R single-commodity flows. Using an iterative re-weighing scheme and repeating ©(n*^ log*^^^^ n) times, we 
achieve a feasible flow and an approximation ratio of 0{R/L) ~ 0(-\/log(n)/e). 

That is, to push flow far away, we sample Ui, . . . ,uji from V and then iteratively push flow along each 
direction. The distribution V will essentially consist of picking a random direction Ui, and then choosing 
Ur+i to be a 1 — l/_R-correlated copy of u^; i.e., a vector extremely close to u^. Because u^+i and are so 
close, it is intuitively clear and easy to argue that if flow gets pushed along at each step, it must be pushed 
far away, as the projections along each u,. will essentially add together. The somewhat counterintuitive fact 
is that flow actually does get pushed further along in this manner. Even though and u,.+i are extremely 
close together, a significant fraction of vertices that were in the "sink set" along u,. will be in the "source 
set" along u,.+i. That phenomenon is a consequence of measure concentration. 



3 The Algorithm 

While we found it most convenient to discuss expander flows and the corresponding game in the context 
of the SPARSEST CUT problem, our algorithm applies most directly to balanced separator, which has a 
similar SDP relaxation and game. Roughly, the difference is that in the balanced separator case the 
embedding player must choose an embedding for which the maximum squared distance between points is not 
much larger than the average. When the average distance is 0(1), this is equivalent to the requirement that 
||vx|| < 0(1) assumed earlier in section [2] The reduction from sparsest cut to balanced separator 
is well-known, and in fact, the unbalanced case is "easy" in the sense that if the cut found is unbalanced, 
it will be an 0(1) approximation to the sparsest cut[7]. In particular, Arora and Kale show that one can 
either obtain an 0(1) cut/flow gap with a single max-flow, or else reduce the problem to the balanced case 
by finding fl(n) points in a ball of radius 0(1) that are still spread-out within that ball; for details, we refer 
the reader to 

The precise statement of the results sketched in section [2] is the following main lemma of |5]. 

Lemma 3.1 ([5 ). Let U C [n] be a set of nodes. Suppose we are given vectors V = {vxyxeu of length at 
most 0(1) such that y^jj \\vx — Vy|p = . 

• There is an algorithm that uses 0(1) expected max-flow computations and outputs either a demand 
graph D on U of max-degree 0(log(n)) that is routable in G with <&(V, D) > \ or a balanced cut of 
expansion O(logn). 

• There is an algorithm that uses a single multicommodity flow computation and 0(1) expected max-flow 
computations and outputs either a demand graph D on U of max-degree 0(1) that is routable in G with 
$(y,D) >\ or a balanced cut of expansion 0(-\/Iogn). 

The importance of the degree is for the running time; if each Z?* has max-degree /3, then the total number 
of iterations needed is 0(/31og(n))[5j. To prove theorem |1.1[ we replace lemma 3.1 with the following. 



Lemma 3.2. Let U, V be as in lemma 3.1 For any e g [0(1/ log(n)), f2(l)], there is an algorithm that uses 
Oijf \og^''^\n)) expected max-flow computations and outputs either a demand graph D on U of max-degree 
0(l/e) routable in G with '^{V^D) >\ or a balanced cut of expansion 0{yJ\og{n) /e) . 
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For the rest of this section, we prove lenima [3T2| We first immediately try to find a cut, using FlowAndCut. 
The parameters c, a are set by the following lemma. 



Lemma 3.3 ([7]). Let U, V be as in lemma \37l\ Then, there exist c,a,^ = so for a random u, with 
probability at least 7 the sets A, B in FlowAndCut{- , c, {v^ -ujajgc/) have (vj, — v^^) ■u> a for all x € A, y € B . 



Let us call the u described by lemma 3.3 good, and set S = 7C/I6. Let e £ [0(l/log(n)), $7(1)] be given 



so that R = 0{\/e log n) yields an expected size bound of v} ^ in theorem 2.3 Set L = 0(e) as in theorem 
2.3 K = 24R/cL, and /3 = 12/cL. The following easy lemma was sketched in section [2] 



Lemma 3.4 f |15l [5]). If FlowAndCut(K,c, . . .) returns a cut of capacity at most ncn, then the cut is cn- 

balanced and has expansion at most k. 

We sample 0(log(rt)) independent u, and run FlowAndCut(K, c, {v^ ■ u}). If we ever find a cut of capacity 
at most Kcn, we immediately output it and stop, yielding a balanced cut of expansion k — 0{y^log(n)/e). 
Otherwise, with very high probability, we are in the situation where there are at least 7/2 good u for which 
a flow of value at least Kcn is returned. In the latter scenario, we will find a flow D with ^{V, D) > 1. 

3.1 Finding a Flow 

We efficiently find a solution to the maximum multicommodity flow problem 
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'y\\ 



maxy^ Da;^||va. - Vy 

x<y (3) 
s.t. F < G, maxdeg^(x) < (3 

X 

of value at least n. The dual assigns lengths {we} to edges and {wx} to the vertices, with the constraint 
that the shortest path distance from x to y under these lengths dominate ||va; — Vj,|p. 

min ^ GeWe + ^ fHwx 

e X 

S.t. Vp : X ^ y Wx + Wy + y^^We > \\vx - Vyll^ 

We use the multiplicative weights framework to approximately solve (|3|. 

Theorem 3.5 ([1TJ[T31I1]). Let A E M™^", b E M™ with b > 0, and consider the following iterative procedure 
to find an approximate solution to Ax < b. 

Initialize y^ £ M™ to the all-Is vector. On iteration t, query an oracle that returns x* such that < 
Ax* < pb and y* • Ax^ l£ y^ ■ b, and then update 

If <ri < 1/2, then after T = pr/-'^ log(n) iterations, A + < (1 + 477)6. 



We use theorem 3.5 with rj = 1/4, initializing the dual variables {we}, {wx} and updating them accord- 
ingly. On iteration t, we flnd a flow {F*,D^) of objective value 2n that violates the constraints by at most 
a factor of p = 0{n^^ log n) and 

WeFl + Ywx deg£,t (a;) < ^ WeG^ + ^ Wxl3 (4) 

ex ex 

After T ~ 0{v?'^ {n)) rounds, scaling the average flow down by 2 yields a feasible flow of objective 
value n. Noting that ^ and the algorithm of theorem 3.5 are invariant to scaling of the dual variables. 



for convenience we will also scale them on each iteration so that w^Gf. + WxP — 2n. In that case, 
any flow of objective value 2?! that only routes along violated or tight paths (those p : x y for which 
X^eep^e + Wx + Wx < \\'Vx — '^y\\^) Satisfies Q. In our algorithm, we will only route flow along paths 
p : X ^ y for which \\vx - VyW^ > L, and Wx,Wy,Y,eep'^<^ - ^Z^- 
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All flows will come from augmenting flows returned by FlowAndCut, where we identify single-commodity 
flows in G U {s,t} with multicommodity flows in G in the obvious way. If F is an acyclic s — t flow in 
G U {s,t}, it is well-known that F can be decomposed into at most m flowpaths. While computing such a 
decomposition could require fl^nm) time, fortunately we need only pseudo-decompose flows in the following 
sense. 

Definition 3.6. If F is an acyclic s — t flow in GU {s,t} with a flow decomposition {{fi,Pi))i<m, then a 
list P = {{fi, Si,ti,£i))i<:m where Pi — s,Si, . . . ,ti,t and X^eep — £i is a pseudo- decomposition of F. 
That is, a pseudo-decomposition is a list containing the amount of flow, second vertex, second-to-last vertex, 
and length of each flowpath. 

The following two lemmas are easy applications of dynamic trees (see [22]). 

Lemma 3.7. Given a flow F onGU {s,t}, a pseudo-decomposition can be computed in 0(rn log n) time. 

Lemma 3.8. Given a flow F on G (J {s, t}, and a desired scaling vector (ai, . . . , am), we can compute the 
flow F' with decomposition {{akfkiPk)} in O(mlogn) time. 



Lemma [3T8] allows us to efficiently cherry-pick "good" flowpaths from the flows returned by FlowAndCut. 
In their analysis, Arora and Kale round the flows returned by FlowAndCut to matchings. We do the 
same, with a small change to ensure doing so does not raise congestion by too much. 



Matching(u) 

Call FlowAndCut (k, c, {v^; • u}) and pseudo-decompose the resulting flow into P. Set M 



Throw away any {fi, Si,ti, £i) e P with (v;^ — Vs.) - u < a, fi < ncn/Am, Wg^ > L/3, Wt- > L/S, or 
> L/3R. 

Greedily match the remaining pairs: iteratively pick (fi, Si, ti, £i) € P, add (si, ti) to M , and remove 
any {fj,Sj,tj,£j) e with {si,ti} n {s-j,tj} ^ 0. 

• Output M. 

The following lemma is essentially the same as one used in ^ , and follows by the choice of parameters. 
The congestion bound, which was not needed for their analysis but is needed for our algorithm, comes from 
the fact that Matching discards any flowpath with fi < ncn/Am before scaling any remaining flows to 1. 

Lemma 3.9. Matching is a [a, S) -matching- cover. Furthermore, for each u, the (unit-weighted) demands 
Matching{u) are simultaneously routable in G with congestion at most Am/ kctl along flowpaths of length at 
most L/SR under {wg}- 

Proof. The symmetry and stretched properties hold by construction, so we need only establish the largeness 
property. Let u be a good direction for which the returned flow has value at least Kcn, and let D be the 
corresponding demands. Since u is good, every demand pair is cr-separated along u. Each x d U has degree 
at most K and the total degree is at least 2Kcn. Deleting each path with fi < kctl /Am removes at most 
Kcn/A total flow. Since J^x ^^/^ — ^^'^ ~ V2/cL, at most cn/A vertices can have > L/3; deleting 
them removes at most ncn/A units of flow. Finally, since the original flow was feasible in the original graph, 

^ fp£p ^^We^ fp <^ WeGe < 2n 
p e p3e e 

Since k = 2AR/cL, at most Kcn/A units can flow along paths longer than L/3R. 

In total, the second step of Matching removes at most Sucn/A units of flow, so at least Kcn/2 total degree 
survives. Each greedy matching step decreases the total degree by at most 4k, so at least cn/8 pairs must 
get matched. Thus, the expected size of Matching(u) is at least {j /2){cn/8) — 6. 

For the congestion bound, we threw away all paths with flow less than ncn/Am, so scaling the remaining 
paths to 1 yields a flow with congestion at most Am/ nan. □ 



On each iteration, we sample Ui, . . . , Uji from the distribution V of theorem 2.3 and call Matching(Ur) 



Let D' be the unit-weighted graph with an edge {x,y) for each {x,y) G Matching(ui, . . . ,uji) with ||v^ 



Vj/P > L. By theorem 2.3 the expected size of D' is at least so after n'^ /2 expected trials, we 
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have \D'\ > n^^^ /2. Applying lemma 3.8 again R times, we can compute a flow F' that routes D' in G 
with congestion R{4m/Kcn) ~ 0{log{n) / L), since m — 0{nlogn) by assumption. Note also that D' has 
max-degree 2. 

Then, D' achieves an objective value of at least |-D'|i, so scaling up by 2n/\D'\L yields a solution of 
value 2n that satisfies Q and congests edges by at most an O(n^logn) factor. Since /? = 12/ci = il(l/e), 
the degree constraints are also violated by at most an O(n^) factor. The running time is dominated by 
flow computations, of which there are an expected 0{Rrf) in each of 0(n^ log^(n)) iterations, for a total of 
0(n^'^ log^''^(n)) expected max-flows. 



4 Proof of Theorem 2.3 



Let Ad be a (tr, (5)-matching cover. We identify M with a weighted directed graph, where edge {x,y) is has 
weight Pru[(a;,y) € M(u)]. The skew-symmetry condition ensures the weights of (x, y) and (?/,a;) are the 
same, as are the in-degree and out-degree of each x. The total out-degree of M is at least 5n by assumption. 
Following [7], we first prune M to a more uniform version by iteratively removing any vertex of out-degree 
less than 5/A. Doing so preserves skew-symmetry, and at least 5n/2 out-degree remains. It follows that we 
are left with a matching cover on vertices X, with \X\ > 6n/2 and every x ^ X has out-degree at least 6/4. 
The pruned M is a (cr, (5/4)-uniform-matching-cover. 

Definition 4.1. A {a, 6) -uniform-matching- cover of X C [n] is a {a, 0) -matching- cover where every 
X ^ X has in-degree at least S in M . 

4.1 Chaining and Measure Concentration 

Let y G X, and let A be the set of u for which y has an out-edge in A/(u). The main idea behind the proof 



of theorem 2.2 is the following. Since A and —A are two sets of measure ri(l), the isoperimetric profile of 
Gaussian space implies there must be many u e A, u e —A that are very close: Iju — u|| < 0(1) (we remark 
that [7} uses the uniform measure on the sphere, but the same analysis holds for Gaussians after scaling 
various quantities by ^/d). Choose x,z with {x,y) G M{u), {y,z) G M{\i) and observe that 

(yy - Vj,) • U = (Vj^ - Vj;) • U - {Wy - Va;) ' ( u " u) 
> - ||V:e - Vj^llllu - U|| 

Thus, either ||v-c — Vj,|| > ^{cr), or else (vj, — v^;) • u > a/2. In the former case, a matching edge joins 
two points of distance ri(l). In the latter case, replacing the edge (?/, z) G M(u) with {x, z) yields an edge 
with (vx — Vx) • u > (3/2)cr. By an inductive argument, the chaining case can be repeated until an edge 
connects two points of distance ri(l). On the one hand, after R chaining steps, we have pairs of points 
that are R matching-hops apart, 0(1) distance apart, and have projection Q{R). On the other hand, with 
high probability, no pair of distance 0(1) has projection Q{^J\og n), so the process must end after 0(-^log n) 
steps. 

To turn the argument into an algorithm, we choose a sequence of highly correlated directions Ui, . . . , u^. 
For R> 1 and < p < 1 , let Afp' be the distribution of Ui , . . . , defined by choosing a standard normal Ui , 
and then choosing each u^+i Uj. to be a /9-correlated copy of u^. That is, each of the d coordinate vectors 
(ui^i, . . . , Ufl.j) are independently distributed as multivariate normals with covariance matrix S^^r' — p^^~^ '. 
In fact, simply setting V — N^_^ii^ achieves theorem 2.3 for R < 0(log^^'^(n)) and size bound of e^'-'^^^^n. 



The barrier is essentially the same as the one that limited the original analysis of [7] to i? = 0(log(n)^/'^). 
To overcome that barrier, we algorithmetize Lee's improvement [TB] by independently sampling uncorrelated 
Wi, . . . , Wfl, and then shuffling the two lists together. The idea is that the highly correlated will give us 
long stretch, while the will greatly increase the probability of forming a long chain, at the cost of losing 
some stretch. The sampling algorithm is: 



Sample(i?, p) 

• Pick ui, . . . , Ufl 7V^, wi, . . . , w/j Af^'. 



• Pick a random shuffling of the two lists, pick a random r < R, and output the first r elements of the 
shuffled list. 
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The reason for the randomness is to keep the algorithm trivial, leaving the work to our analysis. We 
show that there exists a particular shuffling and r < R for which Sample is good; by randomly guessing, we 
lose at most a 2^+^ factor in our final expectation bound, which is negligible relative to the e~'-"^^ bound 
we are aiming for. 

Our proof of theorem |2.3| closely follows Lee's proof of theorem |2.2| the main difference being the use 
of a stronger isoperimetric inequality. The the standard isoperimetric inequality says that if A is a set of 
large measure, then for almost points u, a small ball around u has non-empty intersection with A. We use a 
stronger version, saying that if A is a set of large measure, then for almost all points u, a small ball around 
u has a significantly large intersection with A. 

Lemma 4.2. Let A C M'' have Gaussian measure 6 > 0. //u, u are p-correlated with < p < 1, then 



Pr, 



Pra [u e ^] < (eS) 



< e 



Lemma 4.2 is an easy corollary of Borell's reverse hypercontractive inequality [9 ; we include a short 
proof in appendix |Aj Applications of Borell's result to strong isoperimetric inequalities appear in [48^, and 
we follow the proofs of similar lemmas there. 



4.2 Definitions 

For a matching-cover M and a distribution V over M*, let M{T>) be the random graph A/(ui, . . . , u^) where 
Ui, . . . , Uj. I?. For a random graph Q and sets S,T C [n], let tig{S, T) be the expected number of edges 
from 5 to T in Q. We say S is ^-connected to T in if ^g{S,T) > 7. For singleton sets, we omit braces and 
write fig(x^y) for the probability that the edge {x,y) is in Q. 
Two sets that will be useful are, 

Ba\\[x;i]^{y:\W,-^y\\<i} 
Stretch [a;, a, u] = {y : [vy — v^) • u > cr} 

We will also work with collections of distributions T) = {I?(u)} over M* parameterized by u. Such a collection 
is itself associated with the distribution induced by sampling a standard normal u and then sampling from 
2?(u). 

Definition 4.3. Let V be a distribution collection. We say a vertex x is (a, d,j,i)- covered in M{'D) if for 
least 5 of u, X is '^-connected to Stretch [a;, ct, u] n Ball[x;^] in M(I?(u)). 



4.3 Cover Lemmas 

Our goal is to exhibit a distribution V such that many vertices x are well-connected to X \ Ball[x; in 
M(T>). To do so, we inductively construct particular distribution collections such that many vertices x 
are either e^^^^'^'-connected to X \ Ball[a;;\/Z] in M{V), or else are (f^(r), e"°(''^\ \/L)-covered by 
M{V). 

We begin with a trivial bound on how much points can be covered. 

Lemma 4.4. For ^, 7, 5 > and arbitrary M, V, no vertex is {£.y2log{n/S),S, 7, i) -covered by M{V). 

Proof. For any y S Ball[a;;£], the probability that (vj, — v^;) • u > /3 is at most exp(— /3^/2i'^) < 6/n for 
(3 = £y^2 log(n/(5). It follows that the probability that Stretch[a;, i?^2 log(n/(5), u] n Ball[x; £] is non-empty is 
at most {n — l)S/n < S. □ 

The next lemma says that if a vertex x is connected by V to a set S of vertices that are covered by V, 
then X is covered by the concatenation of V and T). 

Lemma 4.5. Let S be a set of vertices such that each y € S is {a, d,"f,£)- covered by M(T>). Let x be 
a vertex with ^m{v){Xt S C\ Ball[a:;£']) > 7'. Then, x is (cr - v/2f log(2/^), 5/4, 77'(5/4, £ + £')-covered by 
V"{u) = V',Viu). 
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Proof. Let T be the distribution of a;'s out-neighbor in M{'D'), conditioned on 5*0 Ball[x; £']. For each y G S, 
let Ay be the set of u for which y is 7-connected to Stretch[?;. a, u] n Ball[j/; £] in M{'D{u)). 

For any fixed y GT, the quantity (v^; — Vj,) • u is normal with mean zero and variance ||vj, — v^||^ < 
so the probability (over u) that y e Stretch [x, u] is at least 1 — exp(— /3^/2^'^) > 1 — S/2 for (3 = 
log(2/5). Then, for at least 5/2 of u, we have y G Stretch [a;, u] and u e Ay. By averaging, 
for at least 5/4 of u, at least 5/4 of j/ ~ F have y S Stretch[a;, — /3, u] and u € Ay. It follows that x is 
{a - f3, 5/4, 77'5/4, i + f )-covered by V" . □ 

Our next lemma is the main chaining step. 

Lemma 4.6. Let M be a (tJo, ■)-matching-cover, T be a set of vertices that are (cr, 1 — 5/2, 7, oo)-covered in 
M(T>), and S a set of vertices that is 5\T\- connected to T in M. Then, at least 5|r|/2 vertices x Cz S are 
{a + ao,S\T\/A\S\,-f,oo)-covered by X>'(u) = u,X>(u). 

Proof. Let M' be the subgraph of M consisting only of edges from S to T; by assumption the total degree in 
M' is at least 6\T\. Further remove any edge {x, y) e Af'(u) where /iM(X'(u)) (y, Stretch [y, cr, u]) < 7. The total 
in-degree remaining is at least 5|T|/2, so there is a set 5" C S* of at least 5|T|/2 vertices that have out-degree 
at least 5|T|/4|S'|. Finally, note that if {x^y) G M'(u), then Stretch [y, cr, u, cx)] C Stretch [x, cr -f ctq, u, 00], so 
each X e 5" is (cr ctq, 5|T|/4|5|, 7, cx))-covered by V. □ 

To apply lemma [4!6| we need to establish covers with 5 very close to 1. Consider taking a collection T) 
and then smoothing it by replacing I?(u) with the average of I?(u) for nearby u. The next lemma shows 
that doing so boosts 5 to nearly 1, in exchange for a loss in a and 7. 

Lemma 4.7. Let x be (a, 6, j^i)- covered by V. Then, x is (per - 4£^log(2/5), 1 - 25, S'^/'-'^'P^/4,, t)-covered 
by 'T)'{vl) = f (u) where u ~p u. 

Proof. Let A be the set of u for which x is 7-connected to Stretch [x, ct, u] n Ball[x;^] in M(I?(u)). For each 
u, let r(u) be the distribution of x's out-neighbor in A/(I?(u)), conditioned on Stretch [x, cr, u] n Ball[a::;£]. 

For any u and y G r(u), the quantity (vj, — v^;) • u is normal with mean p(vj, — v^) • u > per and 
variance (1 — p^)|ivy — Vj.||^ < 2(1 — p)t^] it follows that y e Stretch[x, per — (3, u] with probability at least 
1 - exp(-/3V4(l - p)^2) > I _ (^/2)4/(i-p) for p = 4€^log(2/5) over u. By averaging, for at least 1 - 5 u, 
for at least 1 - (2/5)(5/2)'*/(i^'') u u, we have Pr [r(u) e Stretch [x, per -/?, u]] > 1/2. Call such pairs 
(u, u) good. 

Applying lemma [4!2| to A, for at least 1 — 5 u, we have Pr[u G A] > 5^/'^^^'''. All together, for at least 
1 - 25 u, with probabihty at least 5^/(1-'') - (2/5)(5/2)''/(i-'') we have both u e A and (u,u) good. In that 
case, pM(X)(ii)(a;, Stretch [x, pa - /3, u] n Ball[x;^]) > 7/2. The lemma follows by noting (2/5)(5/2)^/(^"'') < 

j2/(l-p)/2. □ 

Combining the previous results, we prove the main inductive lemma. 

Lemma 4.8. Let M be a {a, S)-uniform-matching-cover of X where 5 < 1/4. Let £ < cr/2'''-\/log(l/5) and 
K > I. Then, one of the following must occur. 

1. There are distribution collections T)^ , . . . ,T>^ such that for every k < K, at least 5^*^1X1 vertices are 
{ka/A,S^,S^*'^'',£)-covered m M{V^). 

2. There is a distribution D* such that at least 6^^\X\ vertices x are 5^*^ -connected to X \ B[x;£] in 
M{T>*). Furthermore, T>* is a shuffling of M^_^^j^ with Mq for some k < K and k' < 6K. 

Proof. For fc = 0, every x E X is (0, 1, 1, 0)-covered by I?"(u) = (), the empty list. 

Assuming case fll holds for some < k < K,let Tq be those vertices that are (fcer/4, 5^, 7, ^)-covered by 
V'' , where 7 = 5^^ . We begin by finding a set S that is well-connected to Tq. Since at least 5|ro| in-degree 
enters Tq in AI, by averaging either at least 5~^|rol vertices have at least 5^|To|/|X| out-degree into Tq or 
else at least S\Tq\ vertices have at least out-degree into Tq. In the former case, call that set Ti and repeat, 
yielding sets To,Ti, . . . ,Tt where each y G Ts has at least 5^|Ts_i|/|X| out-degree into Ts_i. Let S be those 
vertices with out-degree at least 6^ into Tt, so that S\Tt\ < \S\ < S^^\Tt\. Let V = Mq] by construction. 
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each y e Tt has 



>^2t-t(t-l)/2|j.^|/|^| 
> g3+6k 

Assuming case[2]does not hold by setting V* = V , there is a set T C of size at least (1 — '5^)|T(| such 
that each y eT has HM{V'){y,To n Ball[y;^]) > S^+^''/2 =: 7'. It follows that at least S^\S\ - S^\Tt\ > S^\Tt\ 
out-degree from S enters T. 

implies each y e T is ((fc - 1)ct/4, (5^ 7", 2£)-covered by X>"(u) = V, V''{u) where 7" = -fj'S^ 



Lemma 



4.5 



(we replace factors of 1/4 with (5). Setting p = 1 — l/K, lemma 4.7 implies each y G T is ((fc — 3)(t/4, 1 — 
2J^, Y", 2^)-covered by V"'{u) = X>(u) for u u where 7"' ="^^+17". Finaily, since ^5|Tt|/4|S'| > S\ 

implies at least 6^\Tt\/2 vertices in S are ((A;+ l)cr/4, 5^ 7"', oo)-covered hyV''+^{u) = u,X'"'(u), 



lemma 
where 



4.6 



Assuming case does not hold for V 
1)ct/4, <5^ <524^(^i), £)-covered by P^+i 

Finally, note each Vk consists of a shuffling of ■N'i_i/j^ 
case can occur at most 6/c total times. 



7"' = jl8A'+l+3+6A;^^2 > 2,524A('=+1) 

= P'^+i, at least 5^\Tt\/^ > S^'^''+^'>\X\ vertices in S are ((fc 



with A/'o where fc' < 6fc because the expanding 

□ 



To complete the proof of theorem 2.3 recall M is a (cr, ^/4)-uniform-matching-cover of X. Let 1 < 



R < log(n)/ log(l/(5)). For R < 7, lemma 4.8 implies a typical edge in M has length ri(cr/-y/log(n/(5)) = 
^l{Ra/^/\ogn) since log(n > log(l/(5)) by assumption. That is, setting V = Sample fl, ) suffices. 



case 
suffices 



For R > 7, set K — [i?/7j and £ ~ Ra /2^'^ ^yiog{n) , so that i satisfies lemma 
[1] of le ~ 



3mma 



4.8 



Lemma 



4.4 



implies 



4.8 can not hold for K, so case [2] must hold. That is, setting V — Scmiple(i?, 1 — l/K) 



4.4 Using ±1 Coins 

One might be concerned with issues of precision required for sampling Gaussians. Fortunately, it suffices to 
approximate them by sampling w G {±1}*^ and returning ^ X^iLi ^ ^ O(logn). 

Lemma 4.9. Suppose that instead of a random Gaussian u, we sample a uniform random ±1 matrix 
U € M'^^'^ and set u ~ Ul, where 1 € M'^ has Ij = l/Vk for all j < k. To sample a p-correlated copy u, we 
sample U G M.'^^'^ as a p-correlated copy o/U (i.e., each XJij = XJij with probability p or a random ±1 with 
probability \ — p) and set u = Ul. Then, for k 



0{R^\og{ll5)) 0(log?i), theorem 



2.3 



still holds. 



The proof of lemma |4.9| is straightforward. Lemmas |3.3| |4.4| |4.5| all still hold with similar constants 
1 (see e.g 



even for k 



Pj), so the only issue is lemma 4.7 



For the latter, lemma 4.2 also holds for 
Vy) • (u- 



p-correlated ±1 variables, so the only change needed is in bounding (v^. — Vj^) • (u — u), which is easily done 
for k ~ 0{R^\og{l/5)) = 0(log(n)) using Bernstein's inequality(see e.g. pil,). For completeness, we include 
the details in appendix [B] 



5 Lower-bound for the Cut-Matching Game 

Khandekar, Rao, and Vazirani proposed a primal-dual framework based on the following two-player game 
game, which proceeds for T rounds [T5]. On each round, the cut player chooses a bisection (5*, 5*) of the 
vertices, and the matching player responds with a perfect matching M* pairing each x G 5'* with some y € S* 
The payoff to the cut player is h{H^), where if* — -I- • • • -I- M*. Thus on round t, the cut player aims to 
choose a cut so that any matching response A/* will increase the expansion of if* . 

To see the connection to sparsest cut, suppose the cut player has a strategy that guarantees h{H^) > 
T/k, and consider a matching player that plays as follows. When given a bisection (S,S), the matching 
player connects a source s to all a; G 5 with edges of unit capacity and a sink t to all y G 5*. A simple 



lemma similar to lemma 3.4 implies that if the min-cut is at most n/2, then it has expansion at most one. 



Otherwise, the added edges are saturated, and assuming all edges have integral capacities, the flow can 
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be pseudo-decomposed into a matching; the matching player responds with that matching. Then, after T 
rounds, we have either found a cut of expansion one or else routed in G with congestion T . Assuming 
the cut-player forced h{H"^) > T/ k, scaling down by T yields a feasible flow routing a graph of expansion 
yielding a ^-approximation. 
The following theorems appear in [2D]. 

Theorem 5.1 f|20j). The cut player has an (efficient) strategy to ensure, 

exp (— A2(£^ft)) < nexp 



O(logn)^ 

In particular, after T = 0(log^(n)), the cut player can ensure \2{Cut) > f2(logn), yielding an O(logri) 
factor approximation using 0(log2 n) max-flows. 

Theorem 5.2 f|20j'). The matching player can ensure 



h{H')<0 -= -t 



We prove the following. 
Theorem 5.3. The matching player can ensure 



' log n ' 



Theorem 5.3 does not entirely eliminate the possibility of achieving a better approximation in the cut- 



matching game, and indeed it is known among experts that there exists an (i neffic ient) strategy for the 



cut-player to ensure exp(— < n exp{—t / 0{y^log{n))[19 . However, theorem 5.3 says that doing so will 
require certifying expansion via something stronger than A2(/C_ff*)- For example, one could route another 
expander flow H' in iJ^ and certify h{H^) > X2{Ch') /2. Such an approach seems somewhat awkward 
though, as any such flow might as well have been routed in G directly. 



In theorem 5.2 the matching player arbitrarily identifies the vertices of G with a hypercube, and tries to 
keep the dimension cuts sparse. In particular, it is shown that for any bisection (S*, 5*), there must always 
exist a matching that rai ses the expansion of the average dimension cut by at most 0{l/\/d). 



To prove theorem 5.3 we identify the vertices of G arbitrarily with a dense set of points Vi, . . . , v„ S M 
on the sphere 5'*"^, where d — ri(log(ri)/loglog(n)). Letting Wi, . . . ,Wd G be the column vectors of the 
nx d matrix with row vectors {v^}, we show that for any bisection {S,S) there must be a matching that 

raises the average Rayleigh quotient '^^j^^' by at most 0{l/d). 

The following lemma is an easy generalization of one in |20j . 

Lemma 5.4. Let vi, . . . ,v" G i?**, and let wi, . . . , w^; G M" be defined by w^^ = Vxa- Define, 

V'W = V T — 



d- 

i=l 



If all ||w,||2 >L>0, then. 



2 

yl\ 



Proof. 



^t)-i;{t-i) = -Y^ 



1 W/ Cm^^i 



W 

= 1 



1 



dL 

1=1 xyeM* 

Tl ^ 11^^ -^^11' ° 

a:y£ A/* 
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If wi,...,Wd are as in lemma 5.4 and all orthogonal to the all-Is vector, then X2{CHt) < ip{t); the 
orthogonality condition is equivalent to X^^^a; ~ ^- Having fixed such an embedding, when presented with 
a bisection (5, 5*), the matching player aims to match points so as to minimize the average distance between 
matched points. The analysis of shows that for the hypercube embedding {—1,1}'', one can obtain 
ip{t) — -^{t — 1) < 0{l/\/d). The analysis is not constructive; rather, they use the vertex isoperimetry of the 
hypercube to establish an upper bound on the value of the matching problem's dual LP, and then conclude 
a matching achieving that bound exists by strong duality. Their argument also depends on the fact that for 
the hypercube embedding, the squared distances Hvj, — Vj,|p form a metric. 

In fact, the metric assumption is not needed, and there is also no need to apply LP duality. We give a 
simple proof that large vertex isoperimetry of the embedding implies the simple greedy strategy of iteratively 
matching closest points works. 

Lemma 5.5. Let Vi, . . . , v„ e be a set of points such that, for any S C {vi, . . . ,v„} with \S\ < n/2, 
|Ball[S'; v^JI > (1 + Then, the greedy strategy produces M with, 

E l|v.-v,||2<0(nr) 

xyeM 

Proof. Starting with S,S, we pick x € S*, y G 5 minimizing ||va; — Vj,p, match them, and then remove 
them. Repeated application of the the isoperimetric condition implies that, for all S with \S\ < n/2, 
|Ball[S'; i^r]! > min{l + n/2, (1 + ri(l))*|S'|}. It follows that if two sets A, B have size s, there must be 
X e A, y £ B with - Vy|| < 2ty/r for t = [log(i_|_f^(i)) (n/2s)] + 1 < 2 + 0(1) log(n/2s). Then, the total 
cost of the greedy solution is at most, 

E ^y\\^ < O ^ (1 + log{n/2s)f ■ r 

i/2 



<Oln + J log^(n/2s) ds 



< 0{nr) □ 
For the case of theorem|5.2| let G {—1/^/d, l/v^}^ be the hypercube embeddi ng a nd take L = n/d in 



lemma 



5.4 



The vertex isoperimetry of the hypercube implies r = 0{1/Vd) in lemma 5.5 yielding a strategy 
to ensure~0(i) < 0{nr/dL) ■ t = 0{l/Vd) ■ t. 

To prove theorem |5.3[ we choose as per the following lemma, and take L = n{n/d), r = 0{l/d), 
yielding a strategy to ensure ip{t) < 0{nr/dL) ■ t — 0{l/d) ■ t. 

Lemma 5.6. For every d, there exists a set ofn — 0{Vd)'^ points Vi, . . . , v„ G S'^^^ such that X^iLi ^« ~ 
every i<dhasYTx=i^l,i = ^{n/d), and for every S <Z [n] with \S\ < n/2, IBalliS*; 0(l/v^)]| > (l+f^(l))|5'|. 



The proof of lemma 5.6 is a straightforward application of a construction of Feige and Schechtman|llj . 



which we include in appendix O 



6 Final Remarks 

It will be interesting to see if efficient algorithms can be designed for the generalized sparsest CUT 
problem, where we are given graphs G and H and aim to find a cut {S, S) minimizing ^=°gs,B£s^ " ^-^j-^g^ 

H is the complete graph, the problem is essentially the regular sparsest cut problem, up to a factor of 
two). The results of ^ imply an 0{y^log n log log n)-approximation can be found by rounding a SDP similar 
to (jlj , but to the best of our knowledge no efficient algorithms have been designed to approximately solve 
that SDP. 

The boosting step in our proof of theorem |2.3| crucially depends on use of the noise operator. Many 
hardness of approximation reductions for CSPs also make use of that operator in their soundness analysis; 
what is the connection between how it is used in each case? 

Another question concerns the relation between the expander flow SDP and the original "stronger" SDP 
proposed by Goemans. Constructing integrality gaps for the latter is a notoriously hard problem. Might it 
be any easier to construct them for ([ij? If not, can one always "round" an embedding for the dual of ([ij to 
an embedding satisfying the triangle inequality constraints of Goemans' program? 
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A Proof of Lemma 4.2 



For / : M'^ ^ M>o, let ||/||p = E[/p]1/p, where the expectation is over the multivariate standard normal 
distribution. For x G M'', we write y ~p x for a p-correlated copy of u. The Ornstein- Uhlenback operator is 
defined by, 

Tpfix) = Ky^^M{y)] 

Theorem A.l (Borell[9j). Let / : M'^ ^ M>o and -oo <q<p<l. If Q < < [l - p) /{I - q), then 

||Tp/||,> ll/llp /orO<p2<(l_p)/(l_5) 
By a change of variables, lemma [4?2| is equivalent to, 

Pr„[Pru[u e A] < r] < — 



Let / indicate A, and set p = 1 — p,q — 1 — 1/ p. Note q < Q < p < 1 satisfy theorem A.l 

\\Tpf\U>\\f\\, = 6''^ 
Then, Pru^pu[u £ A] = Tpf{u.), and we have, 

Pr[r,/ < r] = Pr[(T,/)^ > r'] 

< WiTpfW.r-" 

,i-p\ i/p 



so 



S 

For T^^P/S < 1, raising the last line to p can't decrease its value. In the other case, the result is trivial. 



B Proof of Lemma 4.9 



Lemma 



3.3 



only uses the fact that for a vector v and standard normal u, (u- v)^ > ri(||v|p) with probability 



ri(l). That property still holds. 

Lemma B.l. Let U G R'^^'^ be a uniform random ±1 matrix, and let v £ M.'^ be a vector. Then, 

Pr [(v-Ul)2 > ||v||V4] > 1/5 
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Proof. It suffices to consider a unit vector v. Let Z = v ■ Ul. Then, 



E[Z2] = E 



Then, for any < < 1/A, we have, 

\ 2" 



Pr [Z^ < A] < Pr [(1 - tZ^)^ > (1 - <A)^ 
Taking i = A = 1/4 yields. 



E 



< 



[l-tZ^ 
(l-tXf 



1 - 2tE[Z2] + t^F,[Z^] 



□ 



The remaining lemmas require a Gaussian-like bound on stretch; for that, we'll use the following theorem. 

Theorem B.2 (Bernstein's Inequality). Let Xi, . . . , Xn be independent random variables with E[Xi] = 
and X^<1. Let cr^ = ^^^^ E[Xf]. Then, for any t > 0, 



Pr 



^Xi>ta 



< exp 



2 + t/3a 



The next lemma says that if k is large enough, we can obtain Gaussian-like bounds on stretch. Note that 
when p = much better bounds are possible, in that even k ^ I works (see [T]). 

Lemma B.3. Let U G M.'^^'^ be an arbitrary ±1 matrix, and let U XJ be a p-correlated copy o/U. Then, 
for any vector v, and any < t < ^fc(l — p^), 



Pr 



Pr 



V • Ul > p(v Ul) + i^l -P^l|v| 

V • UK p(v • Ul) - t\/l -p2||v| 



Proof. It suffices to consider a unit vector v. For each i < d, j < k, let Z^- = v,j(uy — pXJij)/2, so that we 
have E[Z,j] = 0, < 1, and E[ZfA = (1 - /5^)v2/4. Note that. 



V • Ul = p(v Ul) 



i<d.j<k 

Applying theorem B.2 with = k{l — p2)||v|p/4 = fc(l — p^)/4, we have 



Pr 



E > 



2J 



< exp 



2 + t/3a 



proving the first part. The second part follows by applying the same argument to —Zij 
For lemmas 



□ 



4.7 



4.4 



and 



4.5 



we use p — and t = 0{y^log{l/S)), so k — 0(log(l/(5)) suffices. For lemma 

Zil holds 



we use p = 1 — 1/K and i — 0{y^Klog{l/S)), so k — 0{K'^ log{l/d)) suffices. Also, lemma 
for the uniform measure on the hypercube, as Borell's theorem also holds for / : {—1, -1-1}" ]R>o- 
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C Proof of Lemma 



\5M 



Lemma C.l (Feige, Schechtman [11 ). For each < 7 < 7r/2, the sphere S"^ ^ can be partitioned into 
n = (0(l)/7)'' equal volume cells, each of diameter at most 7. 



Apply lemma C.l with 7 = 1/Vd, yielding cells Ci, . . . , C„ with n = exp(0((ilog 0?)). Let y be a set of 



n arbitrary points, each from a distinct cell; for convenience, let us choose V so that V D {—V) — 0. 

Claim C.2. If A C S"*-i has n{A) > a, then |Ball(A;7) nV\ > an; if A C V has \A\ > an, then 
fj.{Ba\\{A;-f)) > a. 

Proof. For the first direction, if fJ,{A) > a, A intersects at least an of the cells, so Ball [A; 7] contains at least 
an cells, and hence at least an elements of V. For the second, if A CV has size at least an, then Ball [A; 7] 
contains at least an cells, so /i(Ball[A; 7]) > a. □ 

Claim C.3. For every i < d, J2vev'^i — ^{n/d). 



Proof Let A ^ {x e S'^-^ 
IJ.{A) > 1/12. Using claim 
X G Ball(v4;7), we have J2vev 



C.2 



ev 

> ^/2jd}. By bounds on the measure of spherical caps (see e.g. [TT]). 
Ball(A;7) n y| > (l/12)n. Then, since x., > y/2/d - 7 > ^/l/Sd for all 

□ 



> (l/12)n(l/8d) > n{n/d). 
Claim C.4. For every A C V with \A\ < n/2, \Ba\\[A; 0{1/Vd)] n X\ > (1 + 1/12)|A|. 

Proof Let AC V have \A\ < n/2, and set Ai = Ba\\[A;-/], A2 = Ba\\[Ai; 4/Vd], A3 = Ball[A2;7]; the goal 
is to show IA3 nV\ > {I + 1/12)|A|. Note claim [02] ensures fj.{Ai) > \A\/n. If ^{Ai) > (1 + l/12)/2, 
then /i(v42) > (1 + l/12)|j4|/n. Otherwise, by the isoperimetric inequality on the sphere (see e.g. [17!), 
K^2) > (1 + 1/12)m(-4i) > (1 + l/12)|A|/n. By by claim [021 IA3 nV\>{l + 1/12)|A|. □ 

we let V — V Li —V. Clearly J2veV''^ ^ claim 



Now to prove lemma 



5.6 



C.3 



still applies to 

V' , so it remains only to argue claim C.4 still holds for V' . Let A C V' have \A\ < \v'\/2 — n. Let 
A = A^ U A^ where A^ C V and A^ C —V, and suppose \A^\ < \A-\ (in the other case, an analogous 
argument apphes). We consider two cases. First, if \A+\ < \A^\/2, then ju(Ball[A_; 7]) > |A_|/ri, implying 
|Ball[A_;27] nV\> \A-\/n. Therefore, |Ball[A;27] nV'\ > 2|A_| > (4/3)U|. Otherwise, \A+\ < n/2 and 



1^+1 > 1^1/3, so claim 



C.4 



implies \Ba\\[A+■,0{^/^/d)]nV\ > (1- 
V'\ > (1 + 1/12)|A+| + |A_| > (1 + 1/36)|A|. 



1/12)|A+|. Therefore, \Ba\\[A;0{l/Vd)]n 
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